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SPECIFICATION 

METHOD AND SYSTEM FOR SOLVING 
SATISFIABILITY PROBLEMS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] The present application claims the benefit of U.S. Provisional Application Serial No. 
60/450,940, filed February 28, 2003, which is hereby incorporated by reference in its entirety for 
all purposes as if fully set forth herein. 

BACKGROUND AND SUMMARY 

[0002] The present invention is related to satisfiability problems. More particularly, the present 
invention is directed to a method and system for solving satisfiability problems. 

[0003] The satisfiability (SAT) problem consists of determining a satisfying variable 
assignment, V, for a function (or formula),/, or determining that no such V exists. The function 
/may be expressed in conjunctive normal form (CNF), which consists of the logical AND (i.e., 
conjunction) of one or more clauses. Each clause consists of the logical OR (i.e., disjunction) of 
one or more literals. A literal is a Boolean function of one argument or variable. Each variable x 
has a positive literal, i.e., the identity function of x, denoted as x, and a negative literal, i.e., the 
negation function of x, denoted as ~x. 

[0004] One solution to the satisfiability problem is to search through all possible assignments. 
However, with n variables, there are 2" possible assignments. Therefore, as the number of 
variables grows, the number of possible assignments to search through increases exponentially. 
This can make finding a solution very difficult. Since many problems in synthesis, automatic 
test pattern generation (ATPG), verification, artificial intelligence, cryptology, and other fields 
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reduce to SAT problems, it is desirable to discover new methods and systems for solving SAT 
problems efficiently. 

[0005] Some of the SAT-algorithms that have been developed include GRASP, SATO, and 
Chaff, which are described in J. P. M. Silva and K. A. Sakallah, GRASP: A Search Algorithm for 
5 Propositional Satisfiability, IEEE Transactions of Computers, volume 48, pages 506-521, 1999, H. 
Zhang, SATO: An efficient propositional prover, Proceedings of the International Conference on 
Automated Deduction, pages 272-275, July 1997, and M. W. Moskewicz, C F. Madigan, Y. Zhao, 
L. Zhang, and S. Malik, Chaff: Engineering an Efficient SAT Solver, Proceeding of the 38th Design 
Automation Conference, 2001. All of these are search algorithms aimed at finding a satisfying 
10 assignment by variable splitting. Search algorithms of that kind are descendants of the Davis- 
Putnam-Loveland-Longemann (DPLL) algorithm, which is described in M. Davis, G. 
Longemann, and D. Loveland, A Machine program for theorem proving, Communications of the 
ACM, volume 5, pages 394-397, 1962. 

[0006] When looking for a solution, the DPLL-algorithm examines a search tree. This tree-like 
15 search can be considered as an implementation of the tree-like resolution proof system which is 
a special case of general resolution. It was shown in E.Ben-Sasson, R. Impagliazzo, and 
A. Wigderson, Near optimal separation of Treelike and General resolution, Proceedings of SAT-2000: 
Third Workshop on the Satisfiability Problem, pages 14-18, May 2000, that there is an 
exponential gap between the performance of tree-like resolution and that of general resolution. 

20 [0007] Search algorithms, such as GRASP, SATO, and Chaff, have made steps towards general 
resolution by trying to eliminate the drawbacks and limitations of pure tree-like resolution. For 
example, these SAT-solvers record conflict clauses, which are implicates of the original 
formula. Adding conflict clauses allows for pruning of many branches of the search tree that 
have yet to be examined. The deduced implicates are added to a clause database. 

25 [0008] In addition, some of the SAT-algorithms use the strategy of restarts. A restart is when 
the SAT-algorithm abandons the current search tree (without completing it) and starts a new 
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one. So instead of one complete search tree, a set of incomplete trees (except the last one) are 
constructed. In L. Baptista and J. P. Marques-Silva, The interplay of randomization and learning on 
real-world instances of satisfiability, Proceedings of AAAI Workshop on Leveraging Probability 
and Uncertainty in Computation, July 2000 and C. P. Gomes, B. Selman, and H. Kautz, Boosting 
5 combinational search through randomization, Proceedings of International Conference on Principles 
and Practice of Constraint Programming, 1997, the usefulness of restarts was proven 
experimentally. 

[0009] Although current SAT-solvers have made substantial progress towards general 
resolution, much still needs to be done to improve performance and robustness. For example, 

10 there are formulas that can be finished in a few seconds by one SAT-solver, but cannot be 

solved by another SAT-solver. This suggests that these SAT-solvers are not robust, e.g., these 
SAT-solvers either solve a formula quickly or not at all regardless of how much time they are 
given to find a solution. Additionally, current SAT-solvers are not flexible enough to take into 
account a formula's structure. In many cases, this results in poor performance and lack of 

15 robustness, e.g., substantial change in performance occurs after the formula is slightly changed. 

[0010] Some of the current SAT-solvers use the activity of variables in conflict making to single 
out good candidates for branching variables. They compute the activity of a variable by 
counting the number of occurrences of literals of each variable in conflict clauses. Separate 
counters are maintained for each literal. By only taking into account the occurrences of 

20 variables in conflict clauses, there may not be an accurate estimate of the activity of some 

variables. In particular, the variables that actively contribute to conflicts but do not appear in 
conflict clauses may be overlooked. For instance, if a variable is frequently assigned through 
deduction, it may drive many conflicts without appearing in conflict clauses. Therefore, if only 
conflict clauses are taken into account when computing the activity of the variable, this activity 

25 will be underestimated. 
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[0011] Additionally, some SAT-solvers periodically divide the activity counters by a small 
constant. This strategy may not be able to adjust quickly enough to changes in the set of 
relevant variables. For example, suppose activity counters are divided by 2 every 100 conflicts 
and suppose that immediately after dividing the counters, a change of relevant variables occurs. 
5 Then, for the next 20-30 conflicts, the choice of branching variables will be dominated by an 
obsolete set of active variables. 

[0012] Moreover, current SAT-solvers examine the whole set of variables each time they select 
a branching variable. This may be inefficient when there is a large number of variables in the 
set. Further, some of the current SAT-solvers remove conflict clauses based solely on the size 
10 (or length) of the conflict clause, i.e., the number of literals in the clause. However, some 

conflict clauses that have a sizeable number of literals may be more useful than conflict clauses 
with less literals. Consequently, removing conflict clauses simply based on size may be unwise. 
Other SAT-solvers remove conflict clauses whenever the memory is full. Not only is this 
inefficient, it may also lead to the removal of useful conflict clauses. 

15 [0013] The present invention provides a method and system for solving satisfiability problems. 
In one embodiment, a plurality of clauses in a satisfiability problem are organized as a 
chronologically ordered stack. The stack has a top and a bottom and newly deduced conflict 
clauses are added to the top of the stack. A branching variable is then selected from a plurality 
of unassigned variables in the CNF formula and the selected branching variable is assigned a 

20 value of 0 or 1. One or more clauses are marked as satisfied, and Boolean Constraint 

Propagation is invoked when the stack contains one or more unit clauses. The process of 
making value assignment to branching variables (accompanied by invoking Boolean Constraint 
Propagation) goes on until one of the following two conditions hold. If all the clauses of the 
stack are satisfied, the accumulated set of variable assignments is returned as a solution. The 

25 second condition is that a clause which is unsatisfiable under the current set of assignments is 
produced. In that case, a new clause is learned and added to the stack. After that, a subset of 
last assignments is undone and new assignments to branching variables followed by Boolean 
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Constraint Propagation are made. This goes on until an empty clause is derived (the initial 
formula is unsatisfiable) or a solution is found (the initial formula is satisfiable). 

[0014] Further details of aspects, objects, and advantages of the invention are described below 
in the detailed description, drawings, and claims. Both the foregoing general description and 
5 the following detailed description are exemplary and explanatory, and are not intended to be 
limiting as to the scope of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0015] The accompanying drawings are included to provide a further understanding of the 
invention and, together with the Detailed Description, serve to explain the principles of the 
invention. 

5 [0016] Fig. 1 illustrates a process flow of a method for solving satisfiability problems according 
to an embodiment of the invention. 

[0017] Fig. 2 depicts a computer system with which embodiments of the present invention can 
be implemented. 
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DETAILED DESCRIPTION 

[0018] Illustrated in Fig. 1 is a method of solving satisfiability problems according to an 
embodiment of the invention. At 102, a plurality of clauses of the original CNF formula are 
organized as a chronologically ordered stack. The stack has a top and a bottom and may 
5 include both initial and conflict clauses. Initial clauses are clauses of the original CNF formula. 
In some embodiments, the initial clauses are on the bottom of the stack and newly deduced 
conflict clauses are added to the top of the stack. 

[0019] A branching variable is selected from a plurality of unassigned variables of the current 
CNF formula at 104. In one embodiment, when the clause on top of the stack is a conflict 
10 clause, the branching variable is selected from among a set of unassigned variables in the top 
clause rather than from among all of the unassigned variables of the current CNF formula. 

[0020] At 106, a value of one or zero is assigned to the selected branching variable. In deciding 
whether to assign one or zero to the selected branching variable, costs associated with positive 
and negative literals of this variable may be taken into account. The cost of a literal, in some 

15 embodiments, is equal to the total number of conflict clauses containing the literal. The 

branching variable is assigned the value setting to one the literal with a higher associated cost. 
In other embodiments, the cost of a literal is equal to the total number of binary clauses 
containing the literal plus, for each binary clause C containing the literal, the total number of 
binary clauses containing the opposite of the other literal of the clause C . The branching 

20 variable is assigned the value setting to zero the literal with a higher associated cost. 

[0021] Some stack clauses get satisfied as a result of assigning a value to the selected branching 
variable. Besides, all literals of the selected branching variable of one polarity become equal to 
0. The satisfied clauses are marked and the literals set to zero are removed (108). Boolean 
Constraint Propagation is invoked (112) when the stack contains any unit clauses (110). 
25 Otherwise, the process returns to (104). If all clauses of the current formula are satisfied (114), 
the accumulated set of value assignments is returned as a solution (116). If no solution has been 
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found, a determination is made at 118 as to whether a conflict has arisen. A conflict arises when 
all of the literals in a clause are assigned to zero. If no conflicts have arisen, the process returns 
to 104. If a conflict has arisen, a conflict analysis procedure is invoked at 120 to deduce a new 
conflict clause. If the new conflict clause is empty, i.e. it does not contain literals (122), then the 
5 original CNF formula is proven to be unsatisfiable (124), and the process ends. Otherwise, the 
new conflict clause is added to the top of the stack (126). 

[0022] An activity counter is maintained for each variable and each conflict clause in the 
current CNF formula (128). The activity counter of a variable may be incremented each time the 
variable appears in a clause involved in a conflict. A clause C is involved in a conflict if during 

10 Boolean Constraint Propagation, the clause C becomes unit and a deduced assignment is made 
to satisfy C, which later leads to a conflict. The decision as to which branching variable is 
selected at 104 may be based upon the activity of the variable. For example, the selected 
branching variable may be the variable with the highest activity counter among all of the 
unassigned variables in the current CNF formula or among all of the unassigned variables in 

15 the top clause of the stack. The activity counters may be periodically divided by a constant 
greater than one. The last level assignments are undone, (130). 

[0023] At 132, a determination is made as to whether one of two restart conditions is met. One 
restart condition holds true when the number of conflict clauses added to the stack since the 
last restart has exceeded a threshold. In one embodiment, the threshold is 550 conflict clauses. 

20 The threshold may be less or more in other embodiments. The other restart condition is met 
when the number of unit conflict clauses of the current CNF formula exceeds a threshold 
value. If neither restart condition holds true, the process returns to 104. Otherwise, a restart 
takes place. All assignments except ones made to satisfy unit conflict clauses are undone, (134). 
After that, Boolean Constraint Propagation procedure is invoked and the clauses satisfied 

25 during this procedure are removed from the current formula. Besides, some clauses are 

removed from the clause database or stack (136). (In the following exposition the terms "clause 
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database" and "stack of clauses" denote different aspects of the same object which is the 
current CNF formula.) Then the process returns to 104. 

[0024] The conflict clauses removed from the stack during restart may be near the bottom of the 
stack, i.e., earlier deduced conflict clauses. In one embodiment, at least one of the "bottom" 
5 conflict clauses removed from the stack has more than eight literals. In another embodiment, at 
least one of the conflict clauses removed from the bottom of the stack has an activity counter 
value less than sixty. 

[0025] Conflict clauses removed from the stack may also be near the top of the stack, i.e., more 
recently deduced conflict clauses. In one embodiment, at least one of the "top" conflict clauses 
10 removed from the stack has more than forty-two literals. In another embodiment, at least one of 
the "top" conflict clauses removed from the stack has an activity counter value less than seven. 

[0026] In some embodiments, the top conflict clause in the stack is never removed. In other 
embodiments, no less than a specified part of the conflict clauses in the stack are removed. For 
example, no less than 6.25% of the conflict clauses in the stack are removed. 

15 BOOLEAN CONSTRAINT PROPAGATION (BCP) 

[0027] After choosing a branching variable (say variable y) and assigning it a value, the Boolean 
Constraint Propagation (BCP) procedure may be initiated. Any appropriate BCP procedure 
may be used. In one example embodiment, the BCP procedure is as follows. Suppose that y is 
assigned value 1. The negative literal of y, i.e., ~y, is set to 0. Setting the literal ~y to 0 may 

20 produce a unit clause, i.e., an unsatisfied clause in which all literals but one are set to 0. 

Suppose C is such a unit clause and ~x is the literal of C that is not set to any value yet (i.e. the 
variable x is unassigned yet). This clause can be satisfied only by assigning 0 to the variable x. 
This assignment is called deduced from clause C. After making the deduced assignment (e.g., x 
= 0), the procedure of setting literals to 0 (e.g., the positive literals of x) is performed again. 

25 This may lead to producing new unit clauses. 
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[0028] The BCP procedure is performed until (1) a solution is found, (2) all clauses of the 
current CNF formula contain at least two literals, or (3) an unsatisfiable clause, i.e. a clause 
with all literals set to 0, is produced (this situation is called a conflict). Before producing an 
unsatisfiable clause , two unit clauses having the opposite literals of the same unassigned 
5 variable appear, for example, z and ~z. Deducing the value of z from either clause makes the 
other clause unsatisfiable. 

[0029] In one embodiment, the size (or length) of each clause, i.e., the number of unassigned 
variables in each clause, is monitored. For example, each time a literal of a clause is set to 0, the 
counter keeping track of the size of this clause is updated. In another embodiment, rather than 
10 constantly monitoring the size of each clause, two watch literals are selected for each clause. 
The size of a clause C is only checked when one of the watch literals is assigned a value. If the 
assigned value is 1, then the clause is satisfied. If the assigned value is 0 and C is not satisfied 
yet, then a new watch literal is looked for. 

CONFLICT CLAUSE GENERATION AND BACKTRACKING 

15 [0030] Suppose that BCP has been completed and no solution is found in the current node. If 
no conflict is encountered, then the algorithm moves to the next node. If there is a conflict, the 
algorithm backtracks. The best way to describe backtracking is to use the notion of a level of 
assignments. A level of assignments consists of one assignment to a branching variable and all 
the assignments deduced in the following BCP. (In terms of search trees, a level of assignments 

20 consists of all the assignments made at a node of the search tree. For this reason, below, we use 
terms "node" and "level" interchangeably.) All levels are numbered by consecutive numbers, 
the root node of the search tree having the level value equal to 1. Backtracking may be 
chronological (when only the assignments of the current level are erased and the algorithm 
moves to the previous level) or non-chronological (when assignments of more than one 

25 consecutive levels are erased before the algorithm stops backtracking) . Both types of 
backtracking are explained by the example below. 
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[0031] Suppose a CNF formula F consists of the set of clauses of the initial CNF formula F* and 
implicates of the initial formula F* deduced during search. Let C be a clause of F. Suppose that 
a conflict is caused by clause C = -a + x + ~c ('+' denotes logical OR) being unsatisfiable. That is, 
the variables a, x, c have been assigned a = 1, x = 0, c = 1, which resulted in setting all three 
5 literals of C to 0. A set R of value assignments to variables of F is called an assignment leading 
to a conflict on C (or a conflict assignment for short) if, after making the assignment from R and 
running the BCP procedure, clause C becomes unsatisfiable. The clause of I R I literals that are 
all set to 0 by assignments of R is called the conflict clause recording conflict assignment R (or 
just a conflict clause for brevity). 

10 [0032] A trivial assignment making clause C = ~a + x + ~c unsatisfiable is the set {a = 1, x = 0, 
c=l}. However, this conflict assignment is of no interest since the corresponding conflict clause 
is already in the formula. Suppose, that R = {x = 0, y = 1, z = 1} is also an assignment leading to a 
conflict on C. Then clause C = x + ~y + ~z specified by set R is an implicate of the current F and 
can be added to F. All literals of the clause C are set to 0 by assignments from R = [x = 0, y = 1, 

15 z= 1} and so C is a conflict clause. 

[0033] Suppose that assignment x = 0 of R = {x = 0, y = 1, z = 1} is made at the current level. 
Additionally, assume that assignments y - 1 and z = 1 were made at levels v' and 
respectively. Suppose also that v > v' > v" where m > m means that level m is closer to the root 
level than level m. After adding conflict clause x + ~y + ~z to the current F, it is possible to 

20 backtrack from level v to level v' (erasing all the assignments made at the level v and all the 
levels between v and v' if any). The CNF formula Ft/ at level v' is different from the one 
obtained at level v' before because the clause x + ~y + ~z has been added to the clause database. 
Since variables y and z are assigned value 1 at levels v' and v", then at level v' clause x + ~y + ~z 
of Ft/ is unit and so assignment x = 1 can be deduced from it. Before adding clause x + ~y + ~z, 

25 this deduction was not possible. If v > v'+l, then there is at least one level of assignments 
between v and v\ The adding of this clause to the current formula allows the algorithm to 
back-jump (i.e., non-chronologically backtrack) to level v' skipping at least one level. However, 
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if v=v'+l, then the algorithm backtracks chronologically, namely, it backtracks to the previous 
level v'. 

[0034] For conflict analysis, suppose that at the current node v clause -a + x + ~c becomes 
unsatisfiable. To build a conflict clause corresponding to this conflict, a conflict assignment R 
5 that contains only one assignment made at node v needs to be found. This may be an 

assignment made to the branching variable of node v or an assignment deduced in the BCP 
procedure performed at node v. The rest of the assignments of R are made at nodes located on 
the path of the search tree from the root node to the node v. 

[0035] Such a conflict assignment can be constructed from assignment Ri = {a = 1, x = 0, c = 1} 
10 (that is the conflict assignment recorded by the clause ~a+ x + ~c) by performing the BCP 

procedure "backwards' 7 . For example, suppose that all the value assignments of Ri were made 
at node v and assignment a = 1 was deduced from clause a + x + ~z (due to assignments x = 0, z = 
1 made before). Now Ri can be replaced with a conflict assignment R2 = {x = 0, c = 1, z = 1}. This 
conflict assignment is obtained from Ri by replacing assignment a = 1 with x = 0, z = 1. Suppose 
15 that assignment z = 1 was deduced at node v' and v' < v. Then conflict assignment Ri contains 
only two assignments (x = 0 and c = 1) that were made at node v. Proceeding with the "reverse" 
BCP procedure we will eventually produce a conflict assignment containing only one 
assignment made at node v. Suppose, for example, that assignment c = 1 was deduced from 
clause c + ~y + ~z and assignment y = 1 was made at node v" , where v" < v. By replacing c = 1 
20 with y = 1, z = 1 in Ri we obtain conflict assignment R3 = {x = 0, y = 1, z = 1}. R3 satisfies the 
requirement above, because only one assignment of R3, namely x= 0, is made at node v. The 
conflict clause corresponding to R3 is x + ~y + ~z. 

[0036] Formally, the deduction of clause x + ~y + ~z can be described by the following chain of 
resolutions: resolve((~a + jc + ~c),(a + x + ~z)) x + ~c + ~z, resolve((x + ~c + ~z),(c + ~y +~z)) -> 
25 x+~y+ ~z. Only the final result of this chain (i.e., clause x + -y + ~z) is added to the current CNF 
formula. Conflict clauses corresponding to intermediate conflict assignments (e.g., x + ~c + ~z) 
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are thrown away. Clauses of the current F that are used in the deduction of the final conflict 
clause are called clauses responsible for the conflict. In the example, clauses ~a + x + ~c, a + x + 
~z, and c + ~y +~z are responsible for the conflict. Clauses responsible for a conflict are identified 
during the reverse BCP procedure and used to construct the conflict clause to be added to the 
5 clause database. 



VARIABLE SELECTION 

[0037] In one embodiment, each variable is assigned a counter (e.g., ac(z)) that stores the 
number of clauses, responsible for at least one conflict, that have a literal of z. The value of ac(z) 
10 is updated during the reverse BCP procedure. As soon as a new clause responsible for the 

current conflict is encountered, the counters of the variables, whose literals are in this clause, are 
incremented by 1. In another embodiment, the values of all counters are periodically divided 
by a small constant greater than 1, e.g., 4. Periodically dividing counters by a small constant 
decreases the influence of aged clauses and gives preference to recently deduced clauses. 

15 [0038] The clause database may be organized as a stack. Clauses of the initial CNF formula are 
located at the bottom of the stack and each new conflict clause is added to the top of the stack. 
In the process of assigning values, some clauses of the stack are satisfied. The unsatisfied clause 
at the top of the stack will be referred to as the current top clause. If the current top clause is a 
conflict clause, then it is the most recently deduced conflict clause that has yet to be satisfied. 

20 [0039] Suppose C is the current top clause. In one embodiment, the next branching variable is 
selected from among free (unassigned) variables of C when C is a conflict clause. Among the 
variables of C, a variable z with the largest value of ac(z) is selected. Variable z may look 
passive in the set of all the currently free variables. However, the activity of these variables 
may be the result of a very different set of conflicts that happened before deducing C or will 

25 have occurred after the deduction of C. These conflicts may involve different sets of variables. 
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Choosing z takes into account the fact that these active variables may be irrelevant to the 
conflict which had led to the deduction of C and to similar conflicts, i.e., the ones involving sets 
of variables that are close to that of C. 

[0040] Let ~z be the literal of variable z that is in the top clause C. Suppose that z is chosen as 
5 the next branching variable. In one embodiment, C is not immediately satisfied by assigning 
z=0. If C is not satisfied by the chosen assignment to z and no conflict is produced during the 
following BCP procedure, C remains the current top clause. Then a new free variable is selected 
from the ones whose literals are in C. Hence, eventually either C will be satisfied by an 
assignment to a branching variable (or by a deduced assignment) or after removing from C all 
10 literals but one it will be satisfied during the BCP procedure. 

[0041] If the current top clause is a clause of the original formula, then all the unsatisfied 
clauses in the stack are also clauses of the original formula. In an embodiment, the most active 
free variable of the current formula (i.e., free variable z with the greatest value of ac(z)) is 
selected. 

15 BRANCH SELECTION 

[0042] When the current top clause is a conflict clause, one of the goals in the selection of a first 
assignment value to a variable x is to maintain uniform distribution of positive and negative 
literals in conflict clauses of the database. Thus, for each literal Z, cost function lit_ac(l) is 
computed, litjic(l) gives the number of conflict clauses generated so far that contain literal Z. 
20 Initially litjtc(l) is equal to 0. As soon as a conflict clause C is added to the database, for each 
literal Z of C, the value of lit_ac(l) is incremented by 1. In one embodiment, the value of litjicd) 
is not divided by a constant. In another embodiment, the value of lit_ac(l) is not recomputed 
after some conflict clauses are removed from the database. 

[0043] If x is the next branching variable, the literal Z e {x, ~x] with the largest value of litjic(l) is 
25 selected. When lit_ac(x) = lit_ac(~x), literal Z, Z e [x, ~x), is selected at random and x is assigned 
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the value setting the chosen literal I to 1. Hence, clauses having literal I become satisfied and no 
conflict clause can contain 1. On the other hand, the opposite literal (i.e., -/) is set to 0 and so 
conflict clauses may include Thus, setting the literal with the largest value of lit_ac(l) to 1 
reduces the gap between the number of occurrences of x and ~x in conflict clauses of the clause 
5 database. 

[0044] When the current top clause is a clause of the original formula, a cost function nbjtwo(l) 
may be computed for each literal /, which approximates the number of binary clauses in the 
"neighborhood" of literal L A clause is binary if it contains only two literals. Function nbjwo(l) 
is computed as follows. First, the number of all binary clauses containing literal / is calculated. 
10 Then, for each binary clause C containing literal Z, the number of binary clauses containing 

literal ~v is computed where v is the other literal of C. The sum of all computed numbers gives 
the value of nbjzvo(l). This cost function can be considered as an estimate of the power of BCP 
performed after setting / to 0. The greater the value of nbjwo(l) is, the more assignments will be 
deduced from the binary clauses containing literal ~v after setting literal / to 0 in clause C. 

15 [0045] When nbjwo(x) = nb_two(~x), I e {x, ~x] is chosen at random. Variable x is then assigned 
the value setting literal / to 0. To reduce the amount of time spent on computing nbjwo(l), a 
threshold value may be used, for example 100. As soon as the value of nb_two(l) exceeds the 
threshold, its computation is stopped. 

CONFLICT CLAUSE DATABASE MANAGEMENT 

20 [0046] Before starting the next iteration (i.e., building a new search tree), some clauses may be 
physically removed from the database. This allows for reduction of memory used for database 
allocation. In the process of removing clauses, data structures may be partially or completely 
recomputed to fit them into smaller memory blocks. 

[0047] A fraction of clauses may be removed automatically due to retaining some value 
25 assignments deduced in the last iteration. Namely, the value assignments that were deduced 
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from unit conflict clauses (if any) and triggered by assignments deduced from unit conflict 
clauses in the BCP procedure are retained in the new iteration. Clauses that are satisfied by the 
retained assignments are removed from the current formula. 

[0048] The rest of the clauses to be removed may be selected using heuristics described below. 
5 One of the hypothesis for the approach described below is that more recently deduced clauses 
are more valuable because it took more time to deduce them from the original set of clauses. 

[0049] From the view point of clause removal heuristics, the set of conflict clauses is a queue. 
New conflict clauses are added to the tail of this queue while clauses to be considered for 
removal are located at its head. In one embodiment, clauses with more than 8 literals are 

10 removed from the first 1716 th part of the queue starting from the head. Clauses containing more 
than 42 literals are removed from the remaining 15/16 th part of the queue. In another 
embodiment, the last conflict clause is kept regardless of how many literals it has. In a further 
embodiment, a fraction of active clauses are kept regardless of how many literals they have. 
The activity of a clause C may be measured by the number of conflicts C has been responsible 

15 for thus far. In one embodiment, clauses at the head of the queue are considered to be active 
when their activity value is greater than 60. For the rest of the queue, active clauses are ones 
with activity values greater than 7. In another embodiment, the threshold value of the activity 
for clauses at the head of the queue has a starting value of 60 and is increased every 1024 nodes 
of the search tree. Thus, large clauses that are not used in any further conflicts will be removed 

20 from the database. 

[0050] In one embodiment, the current search tree is abandoned after generating 550 conflict 
clauses that are added to the tail of the queue. In another embodiment, before starting a new 
search tree, clauses are removed in order to get rid of at least 1/16 of the conflict clauses in the 
queue. If after applying the rules described above, the number of removed clauses is less than 
25 1/16 of the queue, the threshold on the size of clauses removed from the head of the queue may 
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be decreased by one literal in one embodiment. In one embodiment, once the threshold value 
reaches 4, it is not decreased any further. 

[0051] Looping may occur as a result of removing one or more clauses from the queue, i.e., it is 
possible that a set of clauses that has been removed is deduced again. One way of eliminating 
5 the possibility of looping is to mark one conflict clause deduced since the last restart as non- 
removable, unless it is satisfied by an assignment retained from the previous iteration. This 
guarantees that the number of marked clauses in the database grows monotonically, and 
therefore no looping is possible. The number of marked clauses to be kept can be reduced n 
times by marking a clause only after performing n iterations, i.e., after n restarts. 

10 ILLUSTRATIVE EXAMPLE 

[0052] One example of solving a formula F is set forth below: 

[0053] Suppose that a formula F to be tested for satisfiability consists of the following set of 
clauses (symbol means negation). 



[0054] Initially the stack of clauses is empty. Suppose that the first assignment made is a - 1. 
25 (This assignment is called a decision assignment in contrast to an assignment made to satisfy a 
unit clause. This assignment may also be called deduced). A node m of the search tree is 
created and labeled with variable a. There are two branches (labeled a = 1 and a - 0) coming out 
of this node. Since a = 1 is the first assignment, the branch labeled a = 1 is examined first. After 
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F = { 

Ci = a + b + c, 
Ci = ~a + d, 
C3 = ~b + c, 
C4 = ~d + e, 
Cs = ~e + b + ~c, 
Ce = m + ~d +f, 
C7 = ~m + w, 
C8 = ~m + ~zv + -a 
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this assignment is made, clause Ci is satisfied. Literal ~a is then set to 0 and removed from 
clauses Ci and Cs. Clause C2 then becomes a unit clause as it now only contains literal d. Since 
unit clauses have appeared in the current formula, the BCP procedure is invoked. For the sake 
of clarity it is assumed that BCP is invoked "inside" the node m. Before invoking the BCP 
5 procedure, formula F is equal to 

& = d, 
C?> = ~b + c, 
Q = ~d + e, 
Q = ~ e + \) + ~c, 

10 Ce = m + ~d +f, 

C7 = ~m + w, 
Cs = ~m + ~w 

[0055] The BCP procedure is started by making assignment d = 1 as this is the only way clause 
C2 can be satisfied. As mentioned above, this assignment is called a deduced assignment. After 
15 assigning d = 1, the clause C2 is satisfied while the literal ~d is set to 0 and removed from G and 
Ce. Ca then becomes a unit clause that can only be satisfied by assigning 1 to e. After making 
this assignment, clause Ca is satisfied and literal ~e is removed from Cs. At this point, BCP stops 
because there are no unit clauses left. 

[0056] The current formula (denote it by F(m)) that is obtained by making decision assignment 
20 a = 1 followed by the BCP procedure is now equal to: 

F(ni) = { 
C3 = ~b + c, 
Cs = b + ~c, 
C6 = m +/, 
25 C7 = ~ra + zi;, 

Cs = ~m + ~zf 
} 

[0057] Suppose a new decision assignment m = 1 is made. A new node 7x2, labeled with m, of 
the search tree is created. The branch labeled with m = 1 is then examined. After making this 
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assignment, clause G> is satisfied while clauses O and Cs become unit clauses. The formula is 
now equal to 

Cs = ~b + c, 
Cs = b + ~c, 
5 C7 = w, 

Cs = ~w 

[0058] Since unit clauses (C7 and Cs) appeared in the formula, the BCP procedure can be 
invoked. Suppose BCP is started by satisfying clause Ci , that is, assignment w = 1 is made. 
With that assignment, literal ~w equals 0 and is removed from Cs, which makes this clause 

10 unsatisfiable because it does not contain any literals. This situation is called a conflict because 
in order to satisfy both Ci and Cs, opposite values have to be assigned to variable w. At this 
point a conflict clause is deduced and the procedure backtracks. The deduced conflict clause is 
C9 = ~a + ~m. This clause specifies the fact that making assignments a = 1 and m = 1 leads to a 
conflict. That is, there is no solution (i.e., a value assignment satisfying all the clauses) in which 

15 a = 1 and m = 1. The clause -a + ~m is the first clause to be added to the stack. 

[0059] When the procedure backtracks, it returns to the node where the last decision 
assignment was made, i.e., the procedure returns to node m where the last decision assignment 
m = 1 was made. Note that the formula at the node m, denote it by F*(rci), is 

PN = I 

20 Cs = ~b + c, 

Cs = b + ~c f 

Ce = m +/, 

C7 = ~m + w, 

Cs = ~m + ~w 
25 Cs = ~m 

} 

different from F(m) because conflict clause C9 has now been added. This clause is equal to ~m at 
the node rci because at the root node of the search tree, an assignment a = 1 had been made. 
Hence, literal -a has been removed from clause C9, which makes it a unit clause. Since the 
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current formula contains a unit clause, BCP is invoked. C9 is satisfied by making assignment m 
= 0, which satisfies clauses O? and Cs as well. Literal m is removed from Ce, which makes Ce a 
unit clause with literal/. By setting /to 1, Ce is satisfied. 

[00601 At this point only two clauses (C3= ~b + c and Cs = b + ~c) are left. Therefore, BCP stops 
5 as there are no more unit clauses. Suppose assignment b = 0 is made. The assignment satisfies 
clause C3 and removes literal b from clause Cs. Cs then becomes a unit clause with literal ~c. 
BCP is invoked and c is assigned value 0. At this point all of the clauses of the formula are 
satisfied and a solution has been found. The solution includes the decisions and deduced 
assignments made down the path leading to the last, i.e. current, node, the decision assignment 
10 made at this last node, and the deduced assignments derived after making this last decision 
assignment. These assignments are a = 1, d = 1, e = 1, m = 0,/= 1, b = 0, c = 0. 

SYSTEM ARCHITECTURE OVERVIEW 

[0061] Fig. 2 is a block diagram of a computer system 200 suitable for implementing an 
embodiment of the present invention. Computer system 200 includes a bus 202 or other 
15 communication mechanism for communicating information, which interconnects subsystems 
and devices, such as processor 204, system memory 206 (e.g., RAM), static storage device 208 
(e.g., ROM), disk drive 210 (e.g., magnetic or optical), communication interface 212 (e.g., modem 
or ethemet card), display 214 (e.g., CRT or LCD), input device 216 (e.g., keyboard), and cursor 
control 218 (e.g., mouse or trackball). 

20 [0062] According to one embodiment of the invention, computer system 200 performs specific 
operations by processor 204 executing one or more sequences of one or more instructions 
contained in system memory 206. Such instructions may be read into system memory 206 from 
another computer readable medium, such as static storage device 208 or disk drive 210. In 
alternative embodiments, hard-wired circuitry may be used in place of or in combination with 

25 software instructions to implement the invention. 
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[0063] The term "computer readable medium" as used herein refers to any medium that 
participates in providing instructions to processor 204 for execution. Such a medium may take 
many forms, including but not limited to, non-volatile media, volatile media, and transmission 
media. Non-volatile media includes, for example, optical or magnetic disks, such as disk drive 
5 210. Volatile media includes dynamic memory, such as system memory 206. Transmission 
media includes coaxial cables, copper wire, and fiber optics, including wires that comprise bus 
202. Transmission media can also take the form of acoustic or light waves, such as those 
generated during radio wave and infrared data communications. 

[0064] Common forms of computer readable media includes, for example, floppy disk, flexible 
10 disk, hard disk, magnetic tape, any other magnetic medium, CD-ROM, any other optical 

medium, punch cards, paper tape, any other physical medium with patterns of holes, RAM, 
PROM, EPROM, FLASH-EPROM, any other memory chip or cartridge, carrier wave, or any 
other medium from which a computer can read. 

[0065] In an embodiment of the invention, execution of the sequences of instructions to practice 
15 the invention is performed by a single computer system 200. According to other embodiments 
of the invention, two or more computer systems 200 coupled by communication link 220 (e.g., 
LAN, PTSN, or wireless network) may perform the sequence of instructions required to practice 
the invention in coordination with one another. 

[0066] Computer system 200 may transmit and receive messages, data, and instructions, 
20 including program, i.e., application code, through communication link 220 and communication 
interface 212. Received program code may be executed by processor 204 as it is received, and/or 
stored in disk drive 210, or other non-volatile storage for later execution. 

[0067] In the foregoing specification, the invention has been described with reference to specific 
embodiments thereof. It will, however, be evident that various modifications and changes may 
25 be made thereto without departing from the broader spirit and scope of the invention. For 
example, the above-described process flows are described with reference to a particular 
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ordering of process actions. However, the ordering of many of the described process actions 
may be changed without affecting the scope or operation of the invention. The specification 
and drawings are, accordingly, to be regarded in an illustrative rather than restrictive sense. 
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